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Abstract 

We consider the lambda-coalescent processes with positive frequency of singleton 
clusters. The class in focus covers, for instance, the beta(a, 6)-coalescents with 
a > 1. We show that some large-sample properties of these processes can be derived 
by coupling the coalescent with an increasing Levy process (subordinator), and 
by exploiting parallels with the theory of regenerative composition structures. In 
particular, we discuss the limit distributions of the absorption time and the number 
of collisions. 

Keywords: absorption time, coupling, lambda-coalescent, number of collisions, regenera- 
tive composition structure, subordinator. 

1 Introduction 

The lambda-coalescent with values in partitions of n integers is a Markovian process 
LT n = (U n (t)) t >o which starts at t = with n singletons and evolves according to the rule: 
for each t > when the number of clusters is m, each k tuple of them is merging in one 
cluster at probability rate 

X m ,k= [ x k (l-x) m - k u(dx), 2<k<m, (1) 
Jo 

where v is a measure on the unit interval with finite second moment. The integral repre- 
sentation of rates (CQ) ensures that the processes U n can be defined consistently for all n, 
as restrictions of a coalescent process LToo which starts with infinitely many clusters and 
assumes values in the set of partitions of N, see [23]. The infinite coalescent LToo may be 
regarded as a limiting form of LT n as n — > oo, and uniquely connected to a process with val- 
ues in the infinite-dimensional space of partitions of a unit mass. The lambda-coalescents 
were introduced in the papers by Pitman [23] and Sagitov [25], where the parameteriza- 
tion by finite measure A(dx) = x 2 i>(dx) was used. The reader is referred to the recent 
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lecture notes [21 H] for accessible introduction in the theory of lambda-coalescents and a 



After some number of collisions (merging events) U n enters the absorbing state with 
a sole cluster. Two basic characteristics of the speed of the coalescence are the absorption 
time r n and the number of collisions X n . The large- n properties of r n and X n are strongly 
determined by the concentration of measure v on the unit interval near the endpoints of 



We suppose that v has no mass at 1, which excludes forced termination of Iloo at 
independent exponential time. The coalescent is said to come down from infinity if Il^it) 
has finitely many clusters for each t > almost surely; then r n converge to a finite random 
variable which is the absorption time of Iloo. Otherwise, n^t) almost surely stays 
with infinitely many clusters for all t. There is a delicate criterion in terms of the rates 
A m> fc to distinguish between the two alternatives [26J. 

In this paper we shall study r n and X n under the assumption that Iloo stays infinite due 
to infinitely many original clusters which do not engage in collisions before any given time 
t > 0. This family of lambda-coalescents can be characterized by the moment condition 



We call the collection of singleton clusters of rio^t) the dust component. The dust compo- 
nent has a positive total frequency, meaning that the number of singletons within Tl n (t) 
grows approximately linearly in n as n — > oo. 

The coalescents with dust component do not exhaust all coalescents which stay infinite. 
One distinguished example is the Bolthausen-Sznitman coalescent with u(dx) = x~ 2 dx 
which stays infinite although (j2J) fails. Such examples on the border between 'coming down 
from infinity' and 'possessing dust component' are more of an exception if one considers 
e.g. measures v satisfying a condition of regular variation near zero. 

Under (T5]) every transition of Iloo will involve infinitely many singletons. This suggests 
that most of the collision events of II n will involve some of the original n clusters, for large 
n. Another way to express this idea is to say that in a tree representing the complete 
merging history of IT n , most of the internal nodes are linked directly to one of n leaves. 
We will show that this intuition is indeed correct, to the extent that the behaviour of r n 
and X n can be derived from that of analogous quantities associated with the evolution of 
the dust component. In turn, the total frequency of the dust component of Iloo undergoes 
a relatively simple process, which may be represented as exp(— St) where S = (S t )t>o 
is a subordinator. Similarly for n n , the engagement of original n clusters in their first 
collisions follows a Markovian process which has been studied in the context of regenerative 
composition structures derived from subordinators [13]. A coupling of Iloo with S will 
enable us to apply known results about the level-passage for subordinators, and about 
the asymptotics of regenerative composition structures. 

The connection between Iloo and S was first explored in [TTJ in the special case when 
v is a finite measure, hence subordinator S is a compound Poisson process. While in the 
present paper we are mainly interested in infinite u, the case of finite v is not excluded. 
Moreover, we will be able to extend the results of [11] by removing a condition on v 
imposed in that paper. 
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In a recent paper by Haas and Miermont [TS] results on counting collisions in the 
coalescent and counting blocks in the regenerative composition were derived separately 
in the context of absorption times of decreasing Markov chains. Our approach adds some 
insight to the connection between these two models, and it entails some delicate features 
like differentiating between collisions which involve some original clusters of n n and the 
collisions which do not. 

The possible modes of behaviour of r n and X n for large n are best illustrated by the 
family of coalescents driven by a beta measure 

u(dx) = cx a -' 3 (l - x) fe_1 dx, a,b,c>0. (3) 

These coalescents come down from infinity for a < 1 and stay infinite for a > 1. With 
the account of results of the present paper, we have the following list. 

(i) Case < a < 1. The limit law of (X n - (1 - a)n) /n 1 ^ 2 '^ is (2 - a)-stable (see [IS], 
and [21J for the case 6 = 1). The distribution of is unknown. 

(ii) Case a = 1. The instance b = 1 is the Bolthausen-Sznitman coalescent, for which 
the limit distribution of r n — log log n is standard Gumbel [HE], while X n , suitably 
scaled and centered, converges weakly to a 1-stable distribution [TJ [2D]- The case 
b 7^ 1 remains open. 

(iii) Case 1 < a < 2. In the sequel we show that (r„ — c\ogn) / (logn) 1 ! 2 (with suitable 
c > 0) converges weakly to a normal distribution, and that X n /n 2 ~ a converges 
to the exponential functional of a subordinator. The result about X n was proved 
previously in [18], and in [21] in the case 6=1. 

(iv) Case a > 2. In the present paper we prove that normal limits hold for both r n 
and X n with explicitly determined scaling and centering. In the case a > 2 these 
asymptotics were previously shown in [11] . In the case a = 2 the result for X n was 
derived in [T5] . 

2 The coalescent and singleton clusters 

In the role of the state space of the coalescent IT n with initially n clusters we take the 
set of partitions of [n] := {l,...,n}, in which every singleton cluster is classified as 
either primary or secondary. Under the dust component of H n (t) we shall understand 
the collection of primary clusters. Every nonsingleton cluster of Il n (t) is regarded as 
secondary. For the notational convenience the clusters are written by increase of their 
minimal elements, the elements within the clusters are written in increasing order, and the 
secondary clusters are written in brackets. For instance, 1 (2) (3 5 6) 4 7, a partition of the 
set [7], has three primary clusters and two secondary: 1, 4, 7 and (2), (3 5 6), respectively. 

Introduce A mi i as in ([1]) with k = 1. We have A mj i < oo by assumption ([2]). 

We define the lambda-coalescent IT n as a cadlag Markov process with values in such 
partitions of [n] and the initial state 12 • • • n with n primary clusters. Each admissible 
transition is either merging some clusters in one cluster, or turning a primary singleton 
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cluster into secondary. From partition with m clusters, the transition rate for merging 
each particular fc-tuple of m clusters in one is \ m ,k (2 < k < m), and the transition rate 
for turning each particular primary singleton cluster into secondary singleton cluster is 
\ m ,i- For instance, the sequence of distinct states visited by n 7 could be 



Let N n (t) be the number of clusters in H n (t). Then N n = (N n (t))t>o is a nonincreasing 
Markov process, with the transition rate 



for jumping from m to m — k + 1, for 2 < k < m. Turning a primary singleton cluster 
into a secondary singleton cluster does not cause a jump of N n . The absorption time of 
Il n can be recast as r n = inf{t : N n (t) = 1}, and the number of collisions X n is equal 
to the number of jumps the process N n needs to proceed from n to 1 (which is 4 in the 
above example where the second transition does not alter the number of clusters). 

Removing element n transforms partition of [n] into partition of [n — 1]. For example, 
partitions 1 (2 4) (3), 1 (2) 3 4 and 1 (2) 3 (4) all become 1 (2) (3). Restricting in this 
way Il n to [n — 1], pointwise in t > 0, yields a stochastic copy of H n -i- This follows as 
in [23] since the rates satisfy the recursion X m ^ = \ m +i,k + ^m+i,k+i for all 1 < k < m. 
Therefore we may define IT n on the same probability space consistently for all n. Explicit 
realization will appear in the sequel. 

The projective limit of the processes n n , n 6 N, is a Markov process U^, starting at 
t = with the infinite configuration of primary clusters 12 ■ ■ ■ , and assuming values 
in the space of partitions of the infinite set N. Each partition Iloo^) has only primary 
singletons, namely those original clusters which do not engage in collisions up to time t. 
For a generic singleton, e.g. labelled 1, the time before its first collision has exponential 
distribution with parameter Ai i, and when such collision occurs infinitely many other 
clusters are engaged. 

The differentiation of singletons of H n {t) into primary and secondary becomes trans- 
parent by considering Il n as restriction of Iloo on [n\. The secondary singletons of Il n (t) 
are the unique representatives in [n] of some infinite clusters of n oo (t). The primary 
singletons of H n (t) are also singletons in the partition n oo (t). 

There is a construction of based on a planar Poisson point process in the strip 
[0,1] x [0, oo) with intensity measure v{dx) x dt, see [2j HJ 123]. With each atom (t,x) 
one associates a transition of Iloo performed by tossing a coin with probability x for 
heads. To pass from n oo (t— ) to n oo (t), the coin is tossed for each cluster of n oo (t— ), then 
those clusters marked heads are merged in one, while the clusters marked tails remain 
unaltered. Although there are infinitely many transitions within any time interval if v 
is an infinite measure, condition ([2]) ensures that Iloo does not terminate. In the case of 
finite v transitions of Iloo occur at the epochs of Poisson process with rate ^([0, 1]). 

Let N*(t) be the number of primary clusters in H n (t). By homogeneity properties of 
n„, the process N* = (N*(t)) t > is a nonincreasing Markov process, jumping at rate tp m .k 
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from m to m — k for 1 < k < m. Let 

r* n := infO : N* n {t) = 0} 

be the random time when the last of n primary clusters disappears. For 1 < r < n, 
let K n>T be the number of decrements of size r of (N*) on the way from n to 0, let 
K n := Y^=x^n,r be the total number of decrements of (N*), and let X* be the number 
of non-unit decrements of (N*). Obviously, 

X* n = K n -K nA . (5) 

We call the clusters of partition IT*(t*) that remain at time r* residual, and we denote 
R n the number of residual clusters. 

Processes N n and N* look very similar, thus at a first glance it might seem surprising 
that N* is much easier to handle. The simplification comes from the identification of the 
sequence of decrements of N* with the nth level of a regenerative composition structure 
[T3] , and further connection to the range of a subordinator. The main new contribution 
of the present paper is that N* yields a good approximation for N n for large n, thus X* 
and r* are close to their counterparts X n and r n . In one direction, the connection is quite 
obvious: 

X* n <X n , K(t)<N n (t), <<r n . 

For instance, the first inequality holds since every collision taking at least two primary 
clusters contributes to X n , and since with positive probability some R n > 2 clusters 
remain at time r* when the last primary clusters disappears. 

3 Coupling with a subordinator 

Condition <^ implies that there exists a pure-jump subordinator 5* 
Laplace transform 

E(e- zSt ) = e-^ z \ z>0, 
where the Laplace exponent is given by 

:=!(!- (1 -x) z )u(dx). 
Jo 

The coalescent process will be represented in terms of passage of S through multiple 
exponentially distributed levels. We describe first the evolution of the dust component. 
Let ei, . . . , e n be independent of S i.i.d. standard exponential random variables, and 

let 

£n-.n < ■ ■ • < £n:i be their order statistics. It is not difficult to see that $(?t.) coincides 
with the probability rate at which the subordinator passes through the level e n - n from any 
state St = s < e n:n . The following lemma extends this observation. 

Lemma 3.1. Fort > 0, conditionally given St = s with s G (e n:m+ i, e n:m ) the subordinator 
is passing through e n:m at rate $(m) ; and is hitting at this passage each of the intervals 
(e n:m _ fc+ i, tn-.m-k) at rate ^> m)k , for 1 < k < m < n. 



{S t )t>o with the 
(6) 
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Proof. The proof exploits the Levy-Khintchine formula (jH]) and the memoryless property 
of the exponential distribution. See computations around [T3J Theorem 5.2] for details. 

□ 

Now suppose that each of the primary clusters 1 2 ... n is given an exponential mark 
ei, . . . , e n , and that for every t > the marks €j > S t are associated with primary clusters 
j existing at time t. If t is a jump-time of 5* and the interval {St-, S t ] covers exactly one 
mark ej, we interpret the event of passage through ej as turning the primary cluster j into 
secondary. If (S t -, S t ] covers at least two of the e/s, we interpret this event as a collision 
which takes the corresponding primary clusters. Setting N*(t) := #{j G [n] : 6j > S t } 
we obtain a process with desired rates ip m ,k for transition from m to m — k, as it follows 
from the lemma. In particular, $(n) = Ylk=i fn,k coincides with the total transition rate 
of the coalescent Il n from the initial state 1 2 • • • n. 

A regenerative ordered partition of the set [n] is defined by sending i,j to the same 
block iff T ei = T e ., see [13]. The number of blocks of the partition is equal to the number 
of jumps of N* prior to the absorption at state 0. 

These evolutions of primary clusters are consistent in n. Assigning the exponential 
marks ei,e2, . . . to infinitely many primary clusters 12... defines the initial state of 
the dust component. The frequency of the dust component of Iloo as time passes is the 
decaying process (exp(— S t ))t>o- 

One straightforward application of the representation by 5* concerns r*, the maximal 
lifetime of primary clusters in II n . Let 

T s := mf{t > : S t > s} 

be the first passage time through level s > 0. We can identify r* with T £n l , hence connect 
the limit behaviour of r* to that of T s for high levels s. Indeed, from the extreme-value 
theory it is known that e n:1 — \ogn converges in distribution, as n — > oo, to a random 
variable with the Gumbel distribution. It is also known that the scaled and centered 
random variables (T s — g(s))/f(s) can converge in distribution only if the normalizing 
constant f(s) goes to oo with s. Thus, T en:1 and T logn have the same limit law, if any. 
Moreover, it can be shown that (T s — g(s))/f(s) converges weakly to a given proper and 
nondegenerate probability law if and only if the same holds for {T' s — g(s))/ f(s), where 
T' s is the number of points within [0, s] of a random walk which starts at and has the 
generic step distributed like Si. See [5] (or Proposition 27 in [22]) for a complete list of 
limit distributions of T' s and the conditions of convergence. Summarizing the above, we 
have 

Proposition 3.2. For constants a n > and b n G 1R, if one of the random variables 
(t* — b n )/a n and (T logn — b n )/a n converges weakly, as n — > oo, to a nondegenerate proper 
distribution, then the other random variable converges weakly to this distribution too. 

To realize the full dynamics of II n in terms of the level-passage, a mark is assigned 
to each cluster according to the following rule. At time the marks ei, . . . , e n represent 
the primary clusters 1 2 • • • n. At time t > there is some collection of marks on [St, oo) 
representing the clusters existing at this time. If at time t > the subordinator passes 
through exactly k marks corresponding to some clusters Ii,...,Ik C [n], then a new 
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cluster Ix U • - • U If. is born and assigned a mark S t + e, where e is a copy of the unit 
exponential random variable, independent of S and all other marks assigned before t. For 
instance, if at the first passage time t = T tn . n the subordinator jumps through exactly 
k levels e^, . . . , e Jfe out of e±, . . . , e n , then the secondary cluster J = . . . , j k } is born 
(which is a singleton if k = 1) and assigned a mark exponentially distributed on [S t , oo). 

In particular, when S passes at some time t through only one mark, there is no change 
in n„(t), and the mark of the corresponding singleton cluster is just re- assigned. 



4 The absorption time 

We wish to exploit the lifetime r* of primary clusters as approximation to the absorption 
time T n . At time r* the coalescent process is left with R n residual clusters, whence the 
distributional identity 

r :=0, Tn = T* + f Rn , fi6N, (7) 

where f m is assumed independent of (r*,R n ) and distributed like r m , for each m G No- 
To address the quality of approximation we need to estimate R n . 

We begin with some preparatory work. By the first transition the Markov chain N* 
goes from n to a state with distribution := ^„ jn _ fc /$(n), < k < n — 1. Let g n ^ be 
the probability that N* ever visits state k, so in terms of the realization via subordinator, 
9n,h = P(^e n:fc+ i < T tn . k ) is the probability that the interval [e n ±+i, t-n-.k] intersects the 
range of S. An explicit formula for g n ^ in terms of $ is available (see [13], Equation 
(50)), but it is complicated and inconvenient for computations. 



Lemma 4.1. Suppose {rk)k&n is a nonnegative sequence such that the sequence 
is nonincreasing . Then the sequence (a n )„ e pj defined by 



k 



fceN 



a = 0, a n :=^2g njk r k , n>l 



fc=i 



satisfies the relation 



n — y oo. 



k=l 



Proof. The assertion follows from Lemma 16.11 in the Appendix. Indeed, conditioning on 
the size of the first jump of N* we see that the sequence (a n ) satisfies the recurrence 



n—l 



a = 0, a n = r n + ^2p ntk a k , n E N. 



fc=0 



To apply Lemma [6. II we take ip n = $(n). Condition (C2) holds by the assumptions and 
condition (CI) follows from 



n ^ n—l 1 ™ 

$(n) V"(l - k/n)p n:k = - y^(n - k)ip n ^ k = - k(p n , k = / xv(dx) > 0. 
^— ' n n *— ' /n 

fe=0 fc=0 fc=l ,yu 



□ 
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Note that, since the function s i— >■ $(s)/s is nonincreasing, the sequence f^y^J is 

nonincreasing whenever (r^) is itself nonincreasing. 
Denote u(x) := u([x, 1]), x G (0, 1). 

Lemma 4.2. // either of two equivalent conditions 

/ x _1 dx / v(y)dy < oo, (9) 
Jo Jo 



fe=i 

ZioZc/s then 



k 2 

k=l 



< oo (10) 



ER n = 0(1), n -)> oo, 
m which case the sequence of distributions of the R n 's is tight. 

Proof The equivalence of ([2]) and (fTUj) is established by repeated integration by parts. 

In the genealogical history of each residual cluster there is the last secondary cluster 
appearing as a result of collision or switch involving some primary clusters. If secondary 
cluster b is born at some time t < r* of such an event, and if at this time some j > other 
primary clusters co-exist, then b corresponds to a residual cluster provided that b and its 
followers do not collide with these j primary clusters or their followers before time r*. 
That is to say, b and the j primary clusters belong to distinct branches if the coalescent 
tree is cut at time r*. Let qj be the probability that such cluster b corresponds to a residual 
cluster; restricting the coalescent to j + 1 clusters it is seen that qj indeed depends only 
on j. The consistency property of the coalescent with respect to the restrictions entails 
that qj is decreasing in j. Averaging over the times when some primary clusters engaged 
we find the expected number of residual clusters 

n-l 

ERn = y^ffn,jgj- (11) 

Furthermore, given St = s, we have exactly j exponential marks of the primary clusters 
larger than s. The cluster b is assigned a new exponential mark u = s + e which lies 
within each of the spacings in (s, oo) generated by e n -j, . . . , e n: i with the same probability 
+ 1). If this spacing is (e n: fc + i, e n ±) then b may correspond to a residual cluster 
only if (i) T 6n , k+1 < T u < T tn , k and (ii) b does not collide further with k primary clusters 
and their followers before time r*. If (i) occurs, condition (ii) is not sufficient for the 
correspondence since possible collisions with some of j primary clusters or their followers 
are ignored. This leads to the inequality 

1 

Qj < — y 2^ 9j+i,k+iPk+i,k9k, 1 < J < n - 1, 

3 k=0 

and q = 1. Substituting (p kl = k($(h) — — 1)) we obtain 
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1 ^ fc($(Jfe) - $(k - 1)) 
* - 7+i^ Pi+Wk ^ ^ 



r X)(^(*)-^(*-l))9*- 



where Lemma 14.11 was applied with 



$(fc) 



The required monotonicity condition holds since both and $(&) — $(/c— 1) are decreasing 
in k, the latter by concavity of $. Here and throughout c will denote a positive constant 
whose value is not important and may change from line to line. 

Setting dj = (j + l)qj and bj = c($(j + 1) — $(j))/ (j + 1), we obtain from the above 



a 3 



fc=0 



We want to show that the sequence (a,-) is bounded. To that end, let Mj := max^o,...^ a«, 
then also 

j 

k=0 

Since decreases we have <&(j + 1) — < $(j + 1)/ (j + 1), which taken together 

with (TlOl) implies that the series J^^l converges, so we can choose 

oo 

n := mi{k > : < 1/2}. 

If lim^oo M n = oo then 

1 < hminf ELo&fcMfc = hminf EL "° * fcMfc <Vfe t < 1/2, 

rt-s>oo M„ n->oo j\/f n ^— ' 

fc=no 

which is an obvious contradiction. Therefore (a n ) is bounded. From this 

?i < Mj/(J + 1) < c/j. 

Substituting this bound into (fTTj) and applying Lemma 14.11 leads to the conclusion that 
~ER n remains bounded, as n — > oo, by the virtue of ([TO]) . □ 



Recall that the convergence of T s in distribution always requires a scaling constant 
going to oo as s — >■ oo. Under conditions of Lemma 14.21 the sequence of laws of tr u is 
tight. Now from Proposition 13.21 and the decomposition (|7j) the following main result of 
this section emerges. 
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Theorem 4.3. Suppose holds. For some constants a n > and b n G R, if one of the 
variables (T\ ogn — b n )/a n and ij n — b n )/a n converges weakly, as n — )■ oo, to a nondegenerate 
proper distribution then the other variable converges weakly to the same distribution. 

The value of this result lies in the fact that the limit laws for T s and the conditions 
of convergence are immediately translated into the convergence of r n . Normalizing and 
centering constants are known explicitly, see Proposition 27 in [22] or 0. It follows that 
only stable laws and the Mittag-Leffler laws can appear as the limit distributions of r n . 

If measure v is finite the condition obviously holds. In this case S is a compound 
Poisson process. Theorem 14.31 has been proved [TT] under the assumptions that v is not 
supported by a geometric sequence (1 — x k )t>o (meaning that the law of Si is nonlattice) 
and that 

9 :— / | logx| v(dx) < oo. (12) 



o 



Theorem 14.31 shows that the result of jTT] is still true without requiring ffl~2"]) . 

Assumption (Q is not very restrictive since $(&;) = o(k), k — » oo, always holds. 
Concretely, suppose the right tail of v has the property of regular variation at 0, that is 

u(x) ~ x-^£(l/x), x |0, (13) 

for some function £ of slow variation at oo, and 7 G [0, 1]. Then condition (Q is satisfied 
for 7 G [0, 1). In the edge case 7 = 1 the behaviour of I is important, for instance 
holds for £(y) = (\ogy)~ 5 if 5 > 2 and does not hold if 5 G (1, 2]. 

We use condition (jUj) to bound R n , although we perceive that Qj can be omitted 
and the equivalence in Theorem 14.31 holds in full generality for the coalescents with dust 
component. Note that (J9]) is the local property of z7 near 0. More substantially, the limit 
law is affected by the decay at 00 of the right tail of the distribution of Si, for which the 
behaviour of v near 1 is responsible. We illustrate this by two examples. 

Example: normal limits. Assume in addition to ([9]) that 

s 2 := Var(Si) = f | log(l - x)\ 2 u(dx) < 00. 
Jo 

Then as n — > 00 

T n — m^ 1 logn d . r, , s 

(m ^ logn) 1 ^ 

where m := ESi = J \ log(l — x)\u(dx). 

This setting applies to beta coalescents mentioned in Introduction. We choose the 
constant in ([3]) to be c = 1/B(a, 6), where B is the beta function. The case a > 2 was 
settled in [11]. We focus on the previously open case 1 < a < 2. 

For a = 2 we compute the constants as 

m = b{b + 1)C(2, b), s 2 = 26(6 + 1)C(3, 6), 
where ( is the Hurwitz zeta function. 
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For a G (1, 2) we have 

a + b-1 



m 



(a- l)(2-a) 



1 - (a + b - 2){^(a + 6 - 1) - #(&)} 



9 a + 6 — 1 
s = — r X 



(a - l)(2-a) 

(^2{#(a + 6 - 1) - *(&)} - (a + 6 - 2){(*(a + 6 - 1) - ^(6)) 2 + *'(&) - *'(a + 6 - 1)}^ 

where \I/ is the logarithmic derivative of the gamma function. Finally, condition 
holds since (|T3|) is satisfied with 7 = 2 — a G [0, 1) and constant function t. Therefore, 
convergence (TL4")) holds with the computed m and s. 

Example: stable limits. Assume and 

v(l-e- y )~y-PL(y), y -> 00, (15) 
for some function L slowly varying at 00 and /3 G (1,2). Then 

— > S{p), n — > 00, (16) 



m-^+i)/^ CLlognJ 

where c n is any sequence satisfying lim n ^. 00 nL(c n )/c^ = 1, and «S(/3) is the /3-stable 
distribution with characteristic function 

z 1 — y exp{-\zfT{l -/3)(cos(tt/3/2) +isin(7r/3/2)sgn(^))}, 2 G R. 

To illustrate, consider 

x a ~ 2 dx 



v(dx) 



x) \ loefl — x) 



where d G (2,3) and a G (d, d+ 1). Then (j2J) is satisfied, and condition (1131) holds with 
7 = d+ 1 - a G (0, 1) which implies ©. Condition ([15]) is fulfilled with /9 = d- 1 G (1, 2). 
Therefore, the absorption time r n of such coalescent has limit law ffTB"]) . 



5 The number of collisions 

5.1 Preliminaries 

As an approximation to the number of collisions X n we shall consider X*, the number 
of jumps of N* of size at least two. We will not be able to derive a complete result 
comparable with Theorems 15.31 or 14.31 because the universal criterion for convergence of 
X* is not available. The cases when we know the behaviour of X* (from [IJJ [15], [1] and 
[TO] ) are all covered by the assumption that v satisfies (1131) . We shall also proceed in this 
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direction but exclude the case 7 = 1 when K n> \ is the term of dominating growth in the 
sum K n = Y2r=i Kn,r- By Karamata's Tauberian theorem [6] condition (Tl3|) with 7 < 1 
is equivalent to the analogous asymptotics of the Laplace exponent 

$(z) ~ T(l - -?)z J £(z), z^oo. 

The case of finite v appears when 7 = and $ is an increasing bounded function. 
The sequence (X n ) is nondecreasing and satisfies a distributional recurrence 

X 1 = 0, X n = X n _ Jn+1 + l(J n >2), n>2, (17) 

where in the right-hand side J n is independent of the AYs, Xi = Xi, and J n is distributed 
like the first decrement of N*, that is P(J n = k) = p n ,n-k for 1 < k < n. Similarly, the 
number X* of collisions which involve at least two primary clusters satisfies 

X* = 0, X* n ±X* n _ Jn + l(J n >2), n>2, (18) 
with the convention Xq = 0. We may decompose X n as 

X n = XZ + D n ®K n -K n>1 + D n a.s. (19) 

where D n is the number of collisions which take at most one primary cluster. Thus a 
collision contributes to D n if either exactly one primary cluster merges with at least one 
secondary cluster, or at least two secondary and no primary clusters are merged. 

Lemma 5.1. We have 

n 

ED n <cJ2(H k )/ k )\ ^N. (20) 

k=l 

In particular, if either of two equivalent conditions 

00 

(m/kf < 00 (22) 

k=l 

holds then the sequence of distributions of the D n 's is tight. 

Proof. The equivalence of (I2T1) and (|22p follows from [31 Proposition 1.4]. 

Choose some primary cluster b, to be definite let it be the cluster labelled 1, and 
suppose X n _i is realised as the number of collisions among n — 1 primary clusters [n] \ {b} 
and their followers. Then X n = X n _ 1 + Z n , where Z n is the indicator of the event that the 
first collision of b involves exactly one other cluster a. At the time of the merge of b with 
a the Markov chain N* decrements by two or one, depending on whether a is primary or 
secondary. Let Y n be the indicator of the event that the first involvement of b is either 
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turning b into secondary cluster, or a collision taking at most one other primary cluster 
and arbitrary number of secondary clusters. Clearly, Y n > Z n , therefore from ( TT7|) 

X n < X n _ Jn + F n _ Jn+1 + l(J n > 2), (23) 

d 

where < stands for 'stochastically smaller'. Passing to expectations in (|23|) . (fl8|) and ( TT9|) 
we see that, for d n := KD n , y n := EY n , 

n-1 

di = 0, d n < y^p n ,fc(4 + Vk+i), n = 2,3,..., 
k=i 

and iterating this inequality yields 

n-1 

di = 0, d n < gnjVj+i, n = 2, 3, . . . 

By exchangeability, we have y n = (EK" njl + 2EK n>2 )/n. Since 

m jv- V- V- fc(^(fc)-^(fc-l)) 

JH.K njl = 9n,kPk,k-l = 2_^9n,k |~^T , 

fc=l fe=l ^ ' 

using Lemma [47T1 with = /c($(/c) — — l))/$(k) yields 

EK nA < c$(n), nGN. (24) 

Using this, an inequality shown in Appendix and the monotonicity of $, 

IpjQt pit 

^„,2 < ciEA- rn/2l)1 < c 2 $(\n/2]) <c 2 $(n). 

Thus 

n 

dn < C 7,pn,fc^(fe)/fc, 
fc=l 

and using Lemma [4.11 with = c<&(k)/k results in (TSUI) . □ 
5.2 The compound Poisson case 

Assume that v is a finite measure on (0,1), not supported by a geometric sequence of 
the form (1 — x k )k>o, for some x G (0, 1). Since a linear time change of the coalescent 
does not affect the distribution of X n we will not lose generality by assuming that v is a 
probability measure on (0, 1). Let (Wk)ken be independent copies of a random variable 
W such that the law of 1 — W is v. The subordinator S is then a unit rate compound 
Poisson process with the generic jump | log W| having some nonlattice law. 

The variable K n>r introduced on p. [5] can be identified with the number of parts of 
size r in the regenerative composition of [n] associated with S. Alternatively, K niT has 
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interpretation in terms of the following occupancy model (see e.g. [ID])- Consider a 
random discrete distribution 

P k :=W 1 ...W k . 1 (l-W k ), fc£N, 

with Pfc thought of as a frequency of box k. Suppose n balls are thrown in infinitely 
many boxes, independently given {P k ), with probability Pk of falling in box k for each 
ball. Then K n<r can be identified with the number of boxes occupied by exactly k out of 
n balls. 

Introduce ^ 

m := / | log(l — x)\u(dx) 
Jo 

and for 1 < r < n let x„ ir := KK n , r . 
Proposition 5.2. 

(a) // m < oo then for every r = 1,2,... the vector (K n> x, K nt 2, ■ ■ ■ , K n>r ) converges 
weakly, as n — > oo, to a proper multivariate distribution, and x n>r — > (mr) _1 . 

(b) If m = oo then x njr — > 0, so K n ^ r — > in probability. 

Proof. Part (a) was proved in [12], Theorem 3.3. 

For (b) consider a random walk (Qj)j>o with Q = and the generic step | log W | . 
Then Pj = (1 — Wj) exp(— Qj-i). Using 1 — x < e~ x with x G [0, 1], and substituting e z 
for n, we reduce estimating x„ x = nJ2j>\ Pj{\ ~ Pj) n ~ l to estimating 



E ( e z Pje- eZp i j = E | ^(1 - Wj) exp{z - Qj_ x - e z ~ Q ^(l - Wj)} 
\i>i / \i>i 

f(z-y)dU(y), 



o 



where f(y) := E{(1 - W)exp(y - e^(l - W))} and U(y) := Ej>oHQj < v) is the 
renewal function of the random walk. The function / is nonnegative and integrable, 
since f(y)dy = 1. Furthermore, the function y — > e~ y f(y) is nonincreasing. It is 
known that these properties together ensure that / is directly Riemann integrable (see, 
for instance, the proof of Corollary 2.17 in [8J). When m = E| log Wj\ = oo, application of 
the key renewal theorem yields J °° f{z — y)U(dy) — > 0, as z — > oo, whence x n l — > 0. 

For r > 1 the argument is similar, or one can use the estimate x n r < c r x n l shown in 
Appendix, Lemma [6.21 □ 



The next theorem improves upon a result from [TT] by removing condition ( 1121) . 
Theorem 5.3. For constants a n > such that lim a n = oo, and b n G E, whenever any 

n— >oo 

of the variables 

K n — b n X* — b n X n — b n 
, or 

a n a n a n 

converges weakly, as n — >■ oo, to a nondegenerate proper distribution then all three vari- 
ables converge weakly to this distribution. 
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Proof. Recall representation ( TiTjj) . Since v is a probability measure we have < 1, 

hence condition (1221) is satisfied, and the sequence of laws of the D n 's is tight by Lemma 
15.11 By Proposition 15. 2\ the sequence of laws of the K nj iS is tight as well. By the 
assumption a n — > oo the result follows. □ 

From [10] it is known that, depending on the behaviour of V(x) near x — 1 there are 
five different modes of the weak convergence of, suitably normalized and centered, K n . 
We do not exhibit all these cases here, rather provide an example borrowed from [JO] to 
demonstrate a substantial role of the parameter 9 = log a; |v (da;). 

Example. Suppose v has the right tail of the form 

, |logx| p 

*(*) = 1 m ' ^e(0,i] 



with p > 0. In the case p G (0, 1/2) we have = oo, and 
X n -m- 1 logn + (m(l-p))~ 1 log 1 ^n d 



dog 1 / 2 



-> A/"(0, 1), n ->■ oo, 



where m = J | log(l — x)| z/(dx). 

In the other case, when p > 1/2 (then < oo for p > 1), the centering simplifies, so 
that 

X n - rrT 1 \ogn d lN 

— ^ ^AT(0,1), n^oo. 

clog ' n 



Evolution of secondary particles. In the compound Poisson case the number V t of 
secondary clusters of n oo (t) is finite, for each t > 0. The process V = (V t ) t >o starts with 
Vo = and is a Markov chain with the transition rate <p m ,fc = (T) ^m,k for jumping from m 
to m — k + 1, < k < m, k ^ 1. The rate for k = is given by the same formula (JTJ, and 
y? TO| Q < oo because ^ is finite. The k = transition, resulting in increase of the number of 
secondary clusters by one, occurs when some (in fact, infinitely many) primary clusters 
merge without engagement of secondary clusters. The stationarity of V is a consequence 
of the existence of the dust component with infinitely many clusters. 

It can be shown that the Markov chain V is positively recurrent and has a unique 
stationary distribution (7r m ) found from the balance equation 

oo 

T^m — ^m+k-lfm+k-l,k (25) 

supplemented by the conditions n = and J2m=i = 1- 

Suppose for example that u(dx) = dx is the Lebesgue measure on [0, 1]. In this case 
(p m ,k — (rn + 1) _1 . Equation f[2"5]) becomes 7i m = Y^jL m -\ ^j/ti + ■*■)• Differencing yields 
T^m — ^m+i = ^m-i/ m i which is readily solved as 

e" 1 

TTm = t ttt, m = 1,2,... 

[m — 1)! 
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so in this case the stationary distribution is shifted Poisson. 

In contrast, the number of secondary clusters in the finite coalescent Il n is not a 
Markov process, because the transition rates depend on the number of remaining primary 
particles. 



5.3 The case of slow variation 

Suppose f|T3|) holds with 7 = and slowly varying £(z) — > 00, z — > 00. The Laplace ex- 
ponent satisfies then <3>(z) ~ £(z). Suppose also that the subordinator has finite moments 

m = ES 1 = [ |log(l -x)\u(dx), s 2 = VarSi=/" | log(l - x)\ 2 u{dx). 
Jo Jo 

Choose the centering/scaling constants as 



m Jo z V m ' ^0 

In [TJ it was shown that for n — > 00 

EK n ~ b n , \/NmK n ~ a n , 

and that the normal limit (K n — b n )/a n -4 A/"(0, 1) holds for various classes of functions 
t. In particular, this includes functions of slow variation at infinity with asymptotics as 
diverse as 

^) = log(log(...(log(z))...)), £(z)=log^z, £(z) = exp(\ogPz), 
where /3 > 0. 

The series f )22|) converges for arbitrary £, hence by Lemma [5.11 KD„ = 0(1). On the 
other hand, from fl2l]) and by the properties of slowly varying functions [6] 

EK n>1 = 0($(n)) = o(a n ). 

It is immediate now that (K n — b n )/a n -4 A/"(0, 1) implies both (X* — b n )/a n -A Af(0, 1) 
and (X n - b n )/a n 4 W(0, 1). 

Example: gamma subordinators. Consider the classical gamma subordinator with 
Laplace exponent <&(z) = alog(l + z/(3), where a, (3 > 0. The corresponding v driving 
the coalescent has density 

I log(l - x)\ 

The central limit theorem for K n was proved by different methods in [T3] and pp. From 

this we conclude that the number of collisions also satisfies (X n — b n )/a n A/"(0, 1), where 
the constants can be chosen as 



' /3 log 3 n /3 log 2 n 

; b n = 7. • 
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Example: beta(2, 6)-coalescents. For this family u(dx) = x 1 (1 — x) b 1 dx. The con- 
vergence of X n to the standard normal distribution holds with scaling/ centering constants 

/ s 2 ~ log 2 n 

a " = V^ logn ' bn = ^ 

where m = C(2,6), s 2 = 2C(3,6). 



5.4 Regular variation with index < 7 < 1. 

A key distribution in this case is the law of the random variable 

exp(-7£ t )dt, 



known as the exponential functional of the subordinator ^yS. The distribution of I is 
uniquely determined by the moments 

fc! 



Er 



nil *(70 



From [15] (Theorem 4.1 and Corollary 5.2) X*/a n — > I, where a n = T(2 — 7)n 7 £(n), and 
no centering is required. In fact, K n /a n and K n , r ja n (r > 1) converge almost surely and 
in the mean. 

To justify the convergence of X n using ( |T9i) we need to etimate ED n . For < 7 < 1/2 
we have E,D n = 0(1) since $(-2) ~ c£(z)z 1 , hence the series (|22|) converges. For 1/2 < 
7 < 1 we have 

n 

J2(Hk)/k) 2 ~ cn ^-H 2 (n), 

k=l 

and for 7 = 1/2 the latter sum, as a function of n, has the property of slow variation at 
infinity (see [Sj, Proposition 1.5.8). Thus in any case D n /a n — > in probability. It follows 

that X n /a n -4 /. 

Example: beta(a, 6)-coalescents with 1 < a < 2. In this case 

~^ 7^T — vFTTT / exp{-(2 - a)S t } dt, n -)■ 00. 



n 2 - a (2 - a)T(b) 



This result was obtained in [18] by another method, and with a change of variables the 
equivalence with Theorem 7.1 from [21] in the case 6 = 1 can be established. 

The subfamily of beta-coalescents with parameters b = 2 — a was intensively studied. 
In the literature sometimes a := 2 — a is taken as parameter, so that v in this notation 
becomes 

v{dx) =x- a - 1 (l-x) a - 1 . 

In this case N* decrements like a random walk conditioned to hit and, moreover, there 
is an explicit formula (see [13J p. 471) 

(a) k (a) n -k ( n N 



9n,k i , 1 , 

(a n \k 
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where denotes the rising factorial. The variable K n is then the number of blocks 
in Pitman's (a, a)-partition (or in the regenerative composition induced by excursions of 
a Bessel bridge P3]). We refer to [24] and [2] for further multiple connections of these 
beta-coalescents to various random processes. 

6 Appendix 

A linear recursion. For each n G N, let {p n ,k)o<k<n be a probability distribution with 
Pn t n < 1. Define a sequence (a n ) n£ N as a (unique) solution to the recursion 

n 

a n = rn+Z^Pn^ak, neN, (26) 

k=0 

with given r n > and the initial value ao = a > 0. 

Lemma 6.1. Suppose there exists a sequence (ipn)neN such that 

(CI) liminf n _ Kx> V>n ELo( 1 ~ k / n )Pn,k > 0, 

(C2) the sequence (VVfc/fc)&eN is non-increasing. 

Then (a n ) defined by (126]) satisfies 

k=l 

In particular, (a n ) is bounded if the series J^feLi conver S es - 
Proof. Write for simplicity pk for p n ^ and let ti^ = Ylj=oPj- Using (C2) we have 

r k^k , r n ip n ^ ^ . 

}, ~l~' K k-i > > 7T fc _i = r n ip n > (1 - j/n)pj. 

k=l k=l j=0 

By (CI) there exist n G N and c > such that 



a„ = 0(f:^), n^oo. (27) 



c^-^vr fc _i > r n , n>n . (28) 



fe=i 



From this, x n := c Xwb=i r kipk/k satisfies 

n 

XkPk , n > n (29) 



it=i 
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To check the latter, write 



n n 



"n + ^ XkPk = r « + C ^~^Pk 
k=l j=l k=j 3 

3=1 3 

j=i J i=i J 

r 3^3. 



; TTj — 1 — %n ■ 

3=1 3 

Set Xq := 0. Subtracting ( 126|) from ( 1291) we see that y ra := x n + Cq — a n satisfies y n > 
TUt^PkUk for n > n and arbitrary cq. We can achieve that the recursion for y n holds 
for all n G N by choosing Co > max n < no a n . But then it is easily shown by induction that 
Un > for all n6N, which implies the desired estimate of a n . □ 



Estimates for the occupancy counts. Let {pk)km De a probability mass function. 
Consider the multinomial occupancy scheme in which n balls are thrown independently 
in boxes, with probability pj for box 1,2,... The expected number of boxes occupied by 
exactly r out of n balls is 



Lemma 6.2. For fixed r < s there exists a constant c such that 

x n ,r > oc 2n , s , neN. (30) 
Proof. Using (1 — x)^ 1 > e x for x G (0, 1), 

( n )x r (l - x) n ~ r s' 

[r) { ' > Cl ^-(nxy- s (l-x) s - r ~ n 



( 2 ^)x s (l-x) 2n - s ~ 2 s r\ 

> c 2 (nx) r - s e nx/2 

> c 2 mmy r ~ s e y/2 

y>0 

= C 2 



2(s-r) 



□ 



The result extends immediately to the case of random (Pk)- This generalization was 
used in the proof of Proposition 15.21 with the P/s being the sizes of intervals obtained by 
splitting [0, 1] at points of the range of the process exp(— S). 

Acknowledgement A. Iksanov gratefully acknowledges the support by a grant from the 
Utrecht University. 
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